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Are volatility estimators robust with respect 
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We consider microstructure as an arbitrary contamination of the underlying latent securities 
price, through a Markov kernel Q. Special cases include additive error, rounding and combina- 
tions thereof. Our main result is that, subject to smoothness conditions, the two scales realized 
volatility is robust to the form of contamination Q. To push the limits of our result, we show 
what happens for some models that involve rounding (which is not, of course, smooth) and see 
in this situation how the robustness deteriorates with decreasing smoothness. Our conclusion 
is that under reasonable smoothness, one does not need to consider too closely how the mi- 
crostructure is formed, while if severe non-smoothness is suspected, one needs to pay attention 
to the precise structure and also the use to which the estimator of volatility will be put. 

Keywords: bias correction; local time; market microstructure; martingale; measurement error; 
realized volatility; robustness; subsampling; two scales realized volatility (TSRV) 

1. Introduction 

Recent years have seen an explosion of literature on the problem of estimating integrated 
volatility and similar objects with the help of high-frequency data. For a sample of 
recent literature, see Hull and White [16], Jacod and Protter [18], Gallant et al. [11], 
Chernov and Ghyscls [7], Gloter [12], Andersen et al. [3], Dacorogna et al. [8], Barndorff- 
Nielsen and Shcphard [6] and Mykland and Zhang [22], among others. An important 
realization has been that log prices do not appear to be scmimartingalcs, but rather arc 
like semimartingalcs observed with error. The main hypothesis proposed in the literature 
is that this error occurs by rounding (Dclattre and Jacod [9]; Jacod [17]; Zeng [25]) 
or by additive error (Zhou [28]; Zhang et al. [27]; Zhang [26]; Ai't-Sahalia et al. [1]; 
Bandi and Russell [5]; Hansen and Lunde [14]). More complex (and descriptive) models 
for microstructure are also available; see, for example, Hasbrouck [15] and, from a very 
different perspective, Farmer et al. [10]. 

The multiplicity of ways in which errors can be modeled raises the question of how 
sensitive inference is to modeling assumptions. This is the topic of this paper. 
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We shall be making the assumption that there is a latent log price process X t that is 
a continuous semimartingale of the form 

dX t = fj, t dt + a t dB t , (1) 

where /x t and at are continuous random processes, at is nonzero and B t is a Brownian 
motion. This is also called an Ito process. Transactions at times = to < t\ < ■ ■ ■ < t n = T 
give rise to log prices Y ti that are contaminated versions of X ti as follows. We suppose 
that there is a family Q(x,dy) of conditional distributions so that, given X ti , the law of 
Y u is 

P(Y ti <y\X process) = P(Y ti <y\X u ) = Q(X ti ,y). (2) 

In other words, Y ti is distributed around X ti in a way that depends only on the latter. 
We also assume that Y to , . . ., Y tn arc conditionally independent given the X process. 

A simple example of such contamination Q is additive error on the log scale. If Y = 
X + e, where e has density g and is independent of X, then 

Q(x, dy) = g(y - x) dy. (3) 

Another example is rounding or truncation. In this case, the probability distribution 
Q(x,dy) represents a non-random distortion of x. We shall look at yet another form of 
contamination in Section 3. In that case, the distortion is a combination of additive error 
and rounding. 

This paper has two pieces of news: one good and one bad. We shall see that for 
reasonable types of contaminations Q, we can act as if the error is simply of additive 
type, and we shall see that the two scales realized volatility (TSRV) of Zhang et al. [27] 
is substantially robust to arbitrary contamination. This is our plan for Section 2. 

There arc, however, cases when we have to exercise care. We see one such case in 
Section 3, where we show that it is not always quite clear what is meant by volatility, 
and we have to consider carefully what quantity we actually wish to estimate. This occurs 
in cases that involve rounding. 

We are mainly using TSRV as an example of a volatility estimator, and we believe that 
similar conclusions will apply, for example, to the multiscale realized volatility (MSRV) 
of Zhang [26]. With caveats about additional bias and variance, similar conclusions will 
also apply to traditional realized volatility (RV). 

2. Robustness and smoothness of contamination 
2.1. Setup 

Suppose the latent log price process X follows (1). Let Y be the logarithm of the trans- 
action price, which is observed at times = to < t\ < ■ ■ ■ < t„ = T. We assume that at 



Are volatility estimators robust with respect to modeling assumptions? 603 

these sampling times, Y is related to the latent log price process X through (2). Let 
f(X t ) be the conditional expectation of Y t given the X process: 

f(X t )=E Q (Y t \X t ). (4) 

We assume that 

f(x) is twice continuously differentiable with f'(x) ^ Vx. (5) 



Definition 1. For two generic processes Z^' and and for an arbitrary grid H = 

{sq, si, S2j • • • j s m } of points in the interval [0, T], define 

m 

[z^\z^ = Y J {z^-zi%){z^-z^). 

Definition 2. If Z is a continuous semimartingale, its quadratic variation (Z, Z)t is 
defined as the limit in probability of [Z,Z\^ m if 7i m becomes dense in [0,T] as m— >oo. 
The quadratic variation is also known as the (integrated) volatility of Z for the time 
period [0, T] . 

The above definition gives a well-defined limit (Z, Z)t (independent of the sequence 
H m ) in view of Theorems 4.47 and 4.48 of Jacod and Shiryaev ([19], 52). 

A central problem is that we have two continuous semimartingale processes, X and 
f(X), that produce two volatilities: 

{X,X) T = I o-?dt and (f(X),f(X)) T = [ f'{X t ) 2 a 2 t dt 
Jo Jo 

(see Protter [23], Theorem 29, pages 75-76). An interesting question arises immediately: 
which volatility are the volatility estimators estimating? When we make use of the ob- 
servations Y ti to estimate the volatility, we might think that we are estimating (X,X)t, 
because Y ti is just the contaminated version of X ti , but given the X process, Y ti is 
centered at f(X ti ), rather than X ti . We note that because both X t and f(X t ) are Ito 
processes, without further model assumptions, we have nothing in the model that can 
answer the question of which volatility is the true underlying one. 

These two volatilities (X, X)t and (f(X),f(X))T are often similar quantities if /(x) f=a 
x, which makes the above question not so crucial, but this may not always be the case. 
Our first objective is to clarify which volatility the volatility estimators are estimating 
and how good the approximations are. 

TSRV is a typical example of volatility estimators. For the moment, we focus on 
determining the properties of TSRV. We make use of some of the notations from Zhang 
et al. [27]: 
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Let Q = {0 = to,iiji2) ■ ■ ■ ,tn = T} be the grid that contains all the observation times. 
We suppose Q is partitioned into K non-overlapping subgrids Q^ k \ k = 1, ...,K. As 
introduced in Zhang et al. [27], a typical example of selecting the subgrids is to use the 
regular allocation 

g( k *> = {tk-l,tk-l+K,tk-l+2K, ■ ■ ■ ,tk-l+n k K}- 

Let rifc = |{?^|, the integer making t).-i+n k K the last element in Q^ k \ and let 
n = ^Ef=i«fe = y(» - K + 1). Further define [Z«,Z( 2 )]<£ U) = [Z«,Z( 2 )]g and 
[Z«,Z( 2 )]Cf s) = ^L^ (1 U (2) ]r for the two processes Z« and Z< 2 >. 

Estimators of the form [Y, Y]l} with H C are usually known as the RV. The TSRV 
is given by 

(X^) T ^[Y,Yp s) -^[Y,Yp l \ (6) 

We assume constant step size (Aij = T/n) and that as n — > oo, 

if— > oo and n/K — > oo. (7) 

Note that our results generalize quite predictably if we allow Aij to vary (see the theory 
in Zhang et a/. [27]). 

We also assume that the filtration for (X t ) satisfies the "usual conditions" (see, e.g., 
Definition f.3 of Jacod and Shiryaev [19], page 2) and Condition E of Zhang et al. [27]. 



2.2. Estimators of volatilities — estimators of (f(X) t f(X)) T 

Denote 

e u =Y u -f{X u ). (8) 

Note that under (2), the conditional moments of depend only on the value of X ti . 
We assume that the conditional second moment of is continuous and that there exists 
S > such that the conditional (4 + 2(5 )th moment of e ti is bounded on compact sets; 
that is, 

g(x) := E(e 2 . \X ti = x) is continuous, (9) 
VZ>0 > 3M (4 +a« 0l o. s± V{\e u \ i+2S «\X U = x) <M (i+25oJ) , when x e [-/, I]. (10) 
We shall need the concept of stable convergence, as follows. 

Definition 3. Consider the a -field S = a(X s ,0 < s < T). H^e say £/ia£ a sequence £„ 
converges stably to £ provided, for all F 63 and aZ/ bounded continuous g, ~ElFg(£ n ) — > 
E/f<7(£) as n — * oo, where ( is defined on an extension of the original space. 
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Note that because X is continuous, the stable convergence of a sequence is equivalent 
to the joint convergence of £ n with the process X s , < s < T (see Jacod and Protter [18]; 
Section 2). This would not have been the case if X were discontinuous. Also, note that 
we are using a specific reference er-field 2, which is a little different from standard usage. 

Theorem 1. When we take K = cn 2 ^ 3 (the best possible order of TSRV), under the 
setup assumptions in Section 2.1, suppose (9) and (10) are satisfied, as n—>oo, 

n^{(xX) T (f(x), f{x)) T ) S (J^ £ g (x t f dt + < 2 t) V2 iv(o, 1) 

stably, where 

( 2 = l f \f'{X t )a t fdt. (11) 
Jo 

It is clear from this result what changes and what does not change for this more 
general contamination, compared to the case of independent additive error studied by 
Zhang et at [27]. 

• The volatility being estimated is that of f(X t ). (In Zhang et al. [27], f(x) = x.) 

• The rate of convergence n 1 / 6 is the same as for independent additive error. 

• The asymptotic variance changes to reflect the more complex form of contamination. 

In summary, if we are happy to estimate the volatility of f(X t ), the TSRV is exceed- 
ingly robust. The point about asymptotic variance is an issue only if we wish to set an 
interval around the observation. As can be seen from Zhang et al. [27], this is difficult 
even with straight additive contamination. 

2.3. Proof of Theorem 1 

We need to do some preparations before proving Theorem 1. 

First, note that under the assumption (10), the following statements are true: 

V6><4 + 2(5o V{\e u \ e \X u =x) is bounded on [-1,1] (12) 

(we write the bound as Mrgn) and 

Var(4 \X t . =x) = E(4 \X U = x) - E 2 (e 2 u \X U = x) is bounded on [-1, 1] (13) 

(say, by M( Var n). We shall use these notations in the proof: 

M« = \ £(4-E(4|X)); 
Vn ueG 
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a4 2) = 



where denotes the previous element in C/( fc ) when tj € C/( fc ). £t_ x = 0, £t 4 _ = for 
U =min^( fe ). 

Proposition 1. Assume that F,(\A n \\X) is Op(l). Then A n is Op(l). 
Proof. We have 



P(\A n \ > K) < P(\A n \I mAn]]x) < K , } > K) + P(E(\A n \\X) > K') 

< E (l A »lWll*)<^ } ) + P(E(\A n \\X) > K') 
K 

~E(E,{\A n \\X) I A n \\x)<K'}) 



K 

<^+P(E(\A n \\X)>K') 



P(E(\A n \\X)>K') 



for all K,K'. Hence the result follows. □ 
Lemma 1. We have 

[y,y^ aU) = [ £ ,4 aU) + 0p(i), (w) 

[Y,Y]^ = [ e , e ]£*> + [f(X)J(X)]^+0 P (-±=y (15) 

Proof. Define r; = inf{i : \X t \ > I}, VZ. Note that T\ has the property that 

P(n<T)^0 asZ^oo. (16) 



Also define Af(X u )) = f(X u+1 ) - f(X u ) for i = 0, 1, . . . , n - 1 

V (2,0 



By (12), g(X t ) = E(s%\X t ),t <T is bounded by Mt 2 I) on {r ; >T}; that is, 



E(([/(X),4 aU) ) 2 / {r;>T} |X) 

n-l 

= ^ >t } E ( A /(*t,-J - A/(X ti )) 2 E(4 \X) + AfiX^fEiel \X) 

i=l 

+ Af(X to fE(et\X) 
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n-l 



; I {TI>T} M {2>1) ^(A/(X ti _J - Af(X u )f + A/(X t „_0 2 + Af(X to y< 

_i=l 

= 2/ {T;>T} A./ (2! ^[/(X)J(X)]^ 1) -£ 1 A/(X fi _ 1 )A/(X ti )^ 

<AI {Tl>T} M (U) [f{X) 1 .f{X)] ( f ) 
= O p (1), 

where we have used the Cauchy-Schwarz inequality. Hence, by Proposition 1 and (16), 

[/(X),4 all) =0 P (l). (17) 
Parallel argument shows that E(([/(X), e]^ vg) ) 2 / {Ti>T} \X) = O p (±). Hence, 

lf(X),ep^=0 P (^=). (18) 



'K 

Equalities (17) and (18) imply (14) and (15) because 

[Y,Y]^ = [eAf^ + {f(X)J(X)} ( < dl) +2{f(X),e}^ 

and 

[Y,Y]^ = [f(X),f(X)p s) + [e,sp s) +2[f(X),e} { T avs) . D 

(2) (3) 

Lemma 2. Mj, and M T are asymptotically independently normal conditionally on X , 
both with variance ^ g(X t ) 2 dt. 

Proof. We use (v)t to denote the discrete-time predictable quadratic variations and 
covariations (see Hall and Heyde [13], page 51) in this proof. Note that they are different 
from the continuous time quadratic variations in Definition 2. □ 

(2) (3) 

' and Mj, ' are the end-points of martingales with respect to filtration Ti = cr(et -, 
j<i,X t ,all t): 



( M (2) ;M (2) )t = I \p VarCe^.J^) 
= i^4_ l5 (X tj ) 

ties 

- E (4-, - g(x t ,_ 1 ))g(x u ) + 1 J2 9(Xt^)9(X ti )At. (19) 
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{n>T} 



= Var ( i E(4-. - 9{X u ^))g{X u )I {Tl>T] \X J 
V ties / 

= ^ E Var(4 jX) ff 2 (X ti )/ {Tl>T} 
ties 

^ i E *W)M (2iI) 



1 

n 



hence, by Proposition 1 and (16), the first term of (19) - Y^t&gi £ U-i — ffP^t.-i ))9(Xti ) — >p 
0. Therefore, by (9), 

(m( 2 ),m( 2 )) t = I ^ 9{Xu^)g{X u )At + o p {l) - P - f g(X t fdt. 
Parallel argument shows that 

(M< 3 >,M< 3 >) T = -£ Var( £ti£ti> _|^_x)^pi f g(X t ) 2 dt. 



On the other hand 

K 



K K 



As a consequence, 

E(((Af( 2 ),M( 3 )) T ) 2 / {Tl>T} |X) E 2 (4|X)E( £ t i 4 _/ {T;>T} |X) 



fc=i t 4 egw 

A" 



^E E E 2 ( £ 2 |X) v /E(4_ i / {Ti>T} |X)E(4_/ {Ti>T} |X) 



^ - M (!*) M (4,0 
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By Proposition 1 and (16), (M^,M( 3 )) T ->p 0. 

By assumption (10) and Proposition 1, we can easily see that the conditional Lya- 
punov conditions are satisfied. Also note that the limiting predictable quadratic vari- 
ation y g(X t ) 2 dt and the limiting predictable quadratic covariation (which is zero) 
are measurable in (the completions ol) all the er-ficlds Ti, so we can make use of the 
Remarks immediately following Corollary 3.1 (the martingale central limit theorem) in 
Hall and Heyde [13] to obtain the conclusion. 

Proof of Theorem 1. Denote 

R X = (4 - E« \X)) + (el E(4JX)) 

and 

K 

R 1 = X][( e minS«=> ~ E ( £ mmSCO \ X )) + ( £ LgW ~ E ( £ ma X S«0 1^0)]- 
k=l 

By (13), E(i?fJ {n>T} |X) = Op(l); hence, i?i = O p (l) by Proposition 1 and (16). Simi- 
larly, E(i?2/ Ti >T|A) = O p (K); hence, R 2 = O p (K x / 2 ). As a consequence, 

[e, 4 all) = 2 £ ( £ £ - E(4 |Jf )) - 2 £ + 2 J2 E « I*) ^ 

ti€G ti>0 ti6S 

- 2^(M (1) - M (2) ) + 2 ^ E (4l X ) + °^(!) 

and 

is - 

X[ e ,e](? vg) = 2V^(M« - M< 3 >) - i? 2 + 2J2 E E (4l X ) 

^2^{M {1) -M^) + Op{K 1 I 2 ) + 2Y^ J2 E (4l^)' 
Therefore, conditionally on the X process, 

4(Mr s) -^4 aI1) ) - ^(K[ £ ,4 avs) -[ £ ,4 all) ) 

= {2{M^ -M^))+0 P 

^ N (°4l T 9{Xt)2dt )- (2o) 
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Observe that 



{x,X) T = [Y,Yp &) - ^[r,y]^ all) 

= [f{ X)J(X)]^ + [ £ ,e]^+O p (-L 

[e, elm 11 ^ — O p ( — ) (by Lemma 1 

n \ n J 

= [f(X)J(X)]^ + [e,e] { r VS) - 1\zA { t X } + O p (^ 
and note that ~ Jku ~ V™^ (^0), conditionally on the X process 
^((X^ T -[f(X)J(X)]^)^N(o,^ £ g(X t ) 2 dt\ 



(21) 



On the other hand, f(X t ) is a semimartingale, df(X t ) = (f'(X t )n t + \f"{X t )o 2 )dt + 
f(X t )a t dB t . By Zhang et al. [27], 

^([/(X), /(X)]^ Vg) - (/(X), /(X)) T ) ^> • discrete, (22) 

where £ is defined as in (11) and ^discrete ~ X(0, 1) is independent of the process X. The 
convergence in law is stable. 

Combining (21) and (22), we have 

(XX) T - (f(X), f(X)) T = ((XX) T - [f(X),f(X)p s) ) 

+ ([f(X)J(X)}^ ) (f(X)J(X)) T ) 

5.V2 



The error is minimized when K = 0(n 2//3 ). If we take K = cn 2 ^ 3 , we have [by exploiting 
the conditional convergence in (21)] 



n^ e ({X,X) T -{f(X),f(X)) T )^c j\{X t ) 2 At + ceT^ ' #(0,1) stably. n 
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random 
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> 




rounding 




efficient price price with random error observed price 

Figure 1. Two stage contamination: random error followed by rounding. 



3. A case study 

3.1. Another form of contamination 

Two types of errors — additive errors and rounding errors — have been proposed as can- 
didates of market microstructure errors. Results of when one of them plays the role are 
available. Now, we consider the case when both types of errors are present. 

Suppose at the transaction times, the latent return process X ti is contaminated by an 
independent additive error process r\ti and then rounded to reflect that prices are quotes 
on a grid (typically in multiples of one cent). We thus envision a two stage procedure 
where a latent efficient price S = cxp(X) is first subjected to multiplicative random 
error: S — Scxp(r/). The actual price S is then the rounded value of S. If we take, as 
usual, Y = logS", our final product is the observed process Y ti of logarithms of rounded 
contaminated prices 



Tog((exp(X tj 

where s 1 --^ = a[s/a] is the value of s rounded to the nearest multiple of a. The model is 
somewhat similar to that used by Large [21]. It can be illustrated as in Figure 1. 

For practical purposes, we further assume that the smallest observation of the security 
price is a, which makes observations of the log prices have the form 



Y u =logaVlog((exp(X t 



(23) 



We consider the case when the random errors are independent identically distributed 
normal random variables, with mean and positive variance; that is, 



r\ U ~i.i.d.^V(0, 7 2 ), 7>0. 



(24) 



In this case, 



f(x)=E(Y ti \X ti =x) 

poo 2 



27T7 



(logaVlog((e 2 ) (Q) ))e- (z - 3;) <^ } dz 



(25) 
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is a twice continuously differentiable function with positive first derivative, and the as- 
sumptions (9) and (10) hold. Therefore, by Theorem 1, the TSRV is a robust estimator 



In this study we assume that a is a fixed quantity that is independent of the number 
of observations. Note that in the case where a — > 0, we can expect relatively well posed 
behavior in view of Kolassa and McCullagh [20] and Delattre and Jacod [9] . 

3.2. Robustness works: When 7 is big 

Assume that the latent price process S = exp(X t ) has a small probability of going below 
a for t <E [0,T]. Then, under model (23) and assumption (24), we have 

f(X t )Ks X t and f'(X t ) « 1 for * G [0, T], for suitably big 7s. 

By 'suitably big 7s,' we mean that the size of the random error is large enough that 
the possibility that it pulls the observations of the prices up or down several grid points 
(multiples of a) is not negligible. In this case, when taking the conditional expectation, 
the positive and negative errors cancel out, and this leads to the result that f{X t ) w X t . 

In this case, {f(X)J(X)) T w (X,X) T . Therefore, the TSRV, which is a robust esti- 
mator of (f(X),f(X))T, is a good estimator of (X,X)t as well. 

These relationships are illustrated in Section 3.4. 

3.3. How things can go wrong: When 7 — ► 

When 7 is small but not 0, by Theorem 1, we know that the TSRV goes to the limit 
(f(X),f(X))T robustly. However, this volatility (f(X),f(X))T is no longer close to 



To study the limiting behavior of (f(X),f(X))T, we relate it to the local time L^ of 
the semimartingalc X (for a definition of the local time, see Revuz and Yor [24], page 
222). By Corollary 1.6 of Revuz and Yor ([24], page 224), for any positive Borel function 
$, almost surely, 



Jo J — oo 

By Exercise (1-32) of Revuz and Yor ([24], page 237), for the process X given by (1), the 
family L a may be chosen such that 



of (f(X)J(X)) T . 



(X,X) T . 




(26) 



LI is continuous in a almost surely. 



(27) 



We shall consider only the version of the local time Lf that satisfies the condition (27) . 
Relating (/(Y),/(Y))t to Lf. by (26), we obtain the following result: 
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Theorem 2. As 7 — * 0, for the process X given by (1) and f defined as in (25), almost 
surely, 



where Lf is the local time of the continuous semimartingale X . 

In other words, the 'target' (f(X),f(X))T that we are estimating blows up as 7 goes 
to zero and is of order 1/7. This fact raises questions of whether (f(X ), /(X))y is, in 
this case, the quantity that we are really seeking. 

3.4. Illustration 

We take a typical sample path to illustrate the situation: suppose the latent log price 
process Xt follows (1) with fXt — and a t = 0.2, Vt £ [0,oo) (these arc the annualized 
parameters). Suppose at observation time ti, the price exp(A ti ) first is contaminated 
by an independent multiplicative random error exp(?7 ti ) with rj ti independent identically 
distributed as iV(0,7 2 ) and then is rounded to the nearest multiple of a = 0.01 (one 
cent). The quantity of interest is the volatility of the process over time t £ [0,T] with 
T = 1/252 (i.e., one day). We assume that a day consists of 6.5 hours of open trading 
and that the price process is observed once every second (n = 23400). 

A sample path of the latent log price process X t ,t £ [0,T], is plotted in Figure 2, 
together with its corresponding pure rounded process (the solid line) and two f(X t ) 
processes [see (25)] with 7 = 0.001 (asterisks) and 7 = 0.005 (open circles), respectively. 

Figure 3 records the TSRV of this particular sample path X t ,t £ [0,T], in Figure 2, 
with random contaminations of different sizes (with standard error 7 ranges from 0.0002 
to 0.006). The solid line is the volatility (X,X) T - 

We see from Figure 2 that for this process, when 7 is as large as 0.005, the f(X) is close 
to X, while when 7 is smaller, the process f{X) diverges from X; in fact, it goes closer to 
the (discontinuous) pure rounded process. Figure 3 shows that when 7 is suitably large, 
the TSRV can be a good estimator of (X,X)t, but when 7 is too small, the estimator 
does not estimate (X,X)t, but rather a much larger quantity. Note that although it is 
similarly shaped, this graph is not a signature plot in the sense of Andersen et al. [2], 
because the horizontal axis represents 7 rather than sample size. There is, however, a 
connection between these two types of plots, as shown in equation (29) below. 

3.5. How error and sample size relate to each other — comparison 
to the case when 7 = 

When 7 = 0, the additive error is gone, only the rounding error is present. In this case, 
the observations are themselves the conditional expectations and fix) is no longer con- 
tinuous: 




Y u = f{X ti ) = E(Y U \X U ) = log a V log((exp(A ti ))(»>). 



(28) 




Recall the notation from Section 2.1 about the TSRV (X,X)t- In particular, K is 
the number of subgrids and n = -h{n — K + 1) is the average number of elements in 
the subgrids. Also recall assumption (7), which is equivalent to 'n — > oo and h/n — > as 
n^t co.' A modification of the Jacod [17] proof gives the following result: 

Theorem 3. When 7 = 0, if X — aW where a > and W is a standard Brownian 
motion, we have 

V 1 lv~~v\ 1 rlog((fc+l/2)a) f, fc+lA 

phm^= (X,X ) T = — t=\/- V ^ T ( s — j - ) ' 

where Lf is as in Theorem 2. 

We can see from Theorems 2 and 3 that to first order. 

TSRV under pure rounding and no contamination 
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<X,X>_T 
'o': TSRV 



□ o 
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Figure 3. TSRV vs size of the random contamination, based on one (random) latent log price 
process ({X,X) T = (0.2) 2 /252 w 1.59 ■ 10" 4 ). For more details of the simulation, please refer to 
Section 3.4. 



8n7 2 

— ^— x TSRV under rounding after contamination of size 7. (29) 
a T 

Thus, in a sense, contamination plays a role slightly similar to sample size when there is 
no contamination: 



7 2 under random contamination w n— 5 — under no random contamination. (30) 

In both cases, the sizes of of 7~ 2 and n have similar functions in quantifying the ill— 
poscdness of the respective estimation problems. The deeper meaning of this remains, 
for the moment, a little mysterious, even to us. 
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3.6. Proofs of Theorem 2 and Theorem 3 

Proof of Theorem 2. We have, by (26), for /x(da) = /z 7 (da) = (/'(a)) 2 da, almost surely, 

pT poo 

(f(X),f(X)) T = {f{X t )fd{X 1 X) t = L^(da). 



Recall that f(x) = -^-(loga V log((e*)( a) ))e- (( *- x)2/(272)) dz; hence, 

/OO -i 
-^(loga V log((e z )M))^e-« z -^ 2 /^ 2 )) dz 

- log a V log(exp(x + -fv) ( - a ^)—^=e~ v2/2 dv. 
, 7 V27T 



For k = 1,2, 3, . . ., we have, Vy E R, 
7/'(log(u+-JaJ +2/7 



y log a V log ^exp ^log ( ( k + ^ j aj + y~f + «7^ ~\ J^— e " ^ 2 ^ w 



■ v), y~JV(o,i). 



By the dominated convergence theorem, 



E y (log((fc + l)a) • y/ {y+ v >0} ) + E y (log(/ca) ■ FI {j , +y<0} ) 



1 /* 1 
dog((fc + l)a)-= / ze- 22/2 dz + log(fca)^= / ze~ z2/2 dz 



e-^logf^±l). (31) 



For fc = 1,2,3,..., denote x fe = log((fc + |)a). For any c5 S (0,log(fc + |)) and 

^ 6 (0, log (fe + §)/(*; + !)), 



fa: fe +i5i 

7/ti[xfe -5o,arjfe + di] = / l{f'(x)) 2 dx 



(7/'(^fe + 2/7)) 2 ^{ye[-5 /7^i/7]} d y- 
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By the dominated convergence theorem, 



lim jfi[x k - S ,x k + 8i] 
7— >o 



lim(7/'(x fc +2/7)) I {ye[ _ 5o/jiSl/j]} dy 

-00 7->0 



20? 



1 . fc + 1 

dOg— ; 



log 



2tt 
fc+1 



e y dy 



(32) 



As a consequence, for any 6' , S[ £ (0, \ log(fc + |)/(fc + 5)), 



lim 7/x[x/c + 5q, Xfc+i - S[] = 0. 
7— >o 



(33) 



A simpler argument shows that for any a < x\ and S £ (0,xi — a), 



lim 7/x[a, xi — S] = 0. 

7^0 



(34) 



Define v to be the finite measure on R that has point mass ^^(log^T") 2 on Xk ' 

k = 1, 2, 3, For any continuous function </> that vanishes outside a compact set, suppose 

that the support of is in [— C, C] and that \<j>\ is bounded by M. Denote kc = L7; — 5J 1 
the largest integer fc such that log((fc + \)a) < C. For small 5 > 0, 



/oo °° 



< 



fee 

E 



fc=i 



0(a)7/i(da) 



CCl — 5 



<^(a)7M(da) 



E/ ^( a )7M(da) -^^(zfeM^ 

fe=1 Jx fc -<5 fc=1 

it 



< M^2(j{i[x k +S,x k+1 -5] + ~/(j l [-C,xi -6}) 



k=l 



fee pxk+b /. 

0(a)7Md«)-E/ 



fee r x h +S 



(f>(x k )jn{da) 



fee 



k=1 Jx k -5 
ka 



^ / 0(x fc )7^(da) - 4>(x k )v{x k ) 

k=1 Jx k -S k=1 



618 Y. Li and P. A. Mykland 

As 7^0, the first term above goes to zero by (33) and (34); the second term can be 
arbitrarily small by letting S be small; the third term goes to zero by (32). Hence, 



lim / 0(a)7/i(da) 



^HH»)('^) ! 



In particular, for any u> G O such that L° is continuous in a, is a continuous function 
of a that has compact support. Therefore, by (27), almost surely, 

lim 7 </(X),/(X)>T=li^ n 

Proof of Theorem 3. We borrow the notations from Jacod [17]: For ti = —,i = 

0,1, ...,71, 

e? := X 4i - X tj _ l5 x™ := (f(X u ) - /(Xt^)) 2 , where /(*) is defined in (28), 
R» ■= {(x, y) : f(x) = log(fca), /(») = log((* + l)a) or 
/(*) = log((fc + l)a), f(y) = log(fca)}, 

OO 

i?„ := |J JZ*, 5„:=R 2 \i?„, T(a) := {(x, y) : x < a < y or y < a < x}, 

k=l 

oo -. n 

T n := MT(log((fc + l/2)a)), W n := -= V / fl „ (X u _, , X t 
k=i 

n 

If (x, y) G S„, then either /(x) = f(y), or 

|exp(/(x))-exp(/(y))|>a. (35) 

In the case of (35), without lost of generality, we can assume exp(x) > exp(y) > a. Then 

,,, ■. ■., . (exp(a;))( Q ) cxp{x) + a/2 

f (x -/ (y =log- ——r <log — — 

(exp(y))w exp(y)-a/2 

exp(o;) + exp(a;)/2 , . , . 

- lo s — n rr^T = lo s 3 + F-y ■ 36 ) 

exp(y) - exp(y)/2 

If, in addition, we have that both exp(x) and exp(y) are bounded by M > 0, then there 
exists 9, (x A y) < 9 < (x V y) such that a < | exp(x) — exp(j/)| = exp(#)|.T — y\ < M\x — y\. 
This implies that \x-y\>jfc; hence, log 3 < MMl^ - y|. By (36), 

l/(«)-/(y)|<(^^ + i)N-y|. (37) 
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It is easy to see that (37) holds for all (x, y) £ S n such that exp(cc) and exp(y) are bounded 
by M. Therefore, 

n 1 

E(^ {TlogM >T}) = E^ £; (^^(^-i:^)WM>T}) 
i=l V 

^ i /Miogs^y^ 

which implies (also by making use of (16)) 

W n ^pO asmoo. (38) 

On the other hand, for i = 1, 2, . . . , n, 

E(I{| expCXtJ-exppf^jl^a}) < E(exp(X (iT)/ „) - exp(X ((i _ 1)T)/ „)) 2 / a 2 

= (Eexp(2crM / lT/n ) + Eexp(2crt¥ ((i _i )T)/n ) 

-2Ecxp(2 ( rVF ((l _ 1)T)/ „)Ecxp(aVK T/ „))/a 2 

1 (2cr 2 {i-l)T\( /2a 2 T\ , „ /a 2 T 
= CXP ( n j ( CXP (— j + 1 - 2 CXP 

1 2 ^ (2cr 2 T) fc - 2(<r 2 T/2) fc 



< exp(2a 2 T)£. 



a 2 1 v y ^-J (kln k ) 

, 1 /cxp(2cr 2 T)(exp(2cr 2 r) -cxp(cr 2 r/2)) 
— n \ a 2 



which implies 



1 " 

— /~,-f{|exp(Xt.)-exp(X t . _,)!>«} ~*P as 71^00. (39) 
v n i= i 

Note also that if (X ti _ 15 X 4 J G iJ*, then x" = (log^-) 2 . We have, for fc ; = - |J 
(the greatest integer fe such that log((fc + \)ct) < I), 

-L[Y,Y]^I {n>T} = (wnf^In.JX^X^x? + W n jl {Tl>T} 

= (^(Wni^x^x^-^og^) +w r )i {Tl>T} .m 
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We have R n C T n and | exp(x) — exp(y)| > a when (x, y) &T n \ R n . By (38), (39) and 
(40), we see that, for W' n ^ELi^t^^t,), 

P iim^[y,y]( all >/ {ri>T} 

n — too V Tl 



plim ^ W' n lT(\og((k+l/2)a)) (Xt^Xu) log — - 7 



fc=l 



fc+1 



<{r,>T} 



1 / fc + 1 \ 

= plim ]T -= J2 ^T(log((fc+l/2)a)) (Xt^x , X U ) { fog — — J I {n> T} 

J^r/ k + 1 y i ™ 



((k+l/2)a))(X tz _ 11 X U ) 



I {n>T}i 



and for fc = 1, 2, 3, . . . , by Azai's [4] and Jacod [17], 



1 T (Y Y ^ 2 1 , P r log((fc+l/2)a) 

2^ 7 T(log((fc+l/2)a)) (Af,^ , A t J -> — = y-L 

1 ^ 



Note that i can be chosen to be arbitrarily large in the above argument. Therefore, by 
(16), we have 



plim J-[Y,Y}^ = -±=J- VL T 



log((fc+l/2)a) 



log- 



fc+1 



(41) 



and 



(iVT V 7T 



fe=l 



log 



fc + 1 



Applying these results to the TSRV (6), noting that (41) implies ^[Y",Y"]( a11 ) 
by assumption (7), yields 



-±=(X,X) T = ^=[[Y,Y]^-^[Y,Y} T 



1 2 



fe=i 



log((fc+l/2)a) 



log- 



fc + 1 



□ 



4. Conclusion 



We have shown in this paper that the robustness of the two scales realized volatility 
(TSRV) depends crucially on the deterministic part of the distortion through the function 
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/ defined in (4). On the other hand, in terms of consistency and order of convergence, 
the TSRV is always robust to the random part of the error (Y — f(X)). In Section 3, we 
have studied a particular model of contamination, involving random error followed by 
rounding, and we have seen that, in this case, depending on parameters, the non-random 
distortion can be benign or problematic. 

A lesson from our study is that there are really two candidates for the term 'volatility' 
namely (X,X)t and (f(X),f(X))T, and that in some cases they can diverge substan- 
tially. Further investigation of what quantity one wishes to estimate necessitates more 
research into the use of realized volatility estimates in such applications as portfolio 
management, options trading and forecasting. 
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